AITopics | Rogers County

Collaborating Authors

Rogers County

NER Retriever: Zero-Shot Named Entity Retrieval with Type-Aware Embeddings

Shachar, Or, Katz, Uri, Goldberg, Yoav, Glickman, Oren

arXiv.org Artificial IntelligenceSep-5-2025

We present NER Retriever, a zero-shot retrieval framework for ad-hoc Named Entity Retrieval, a variant of Named Entity Recognition (NER), where the types of interest are not provided in advance, and a user-defined type description is used to retrieve documents mentioning entities of that type. Instead of relying on fixed schemas or fine-tuned models, our method builds on internal representations of large language models (LLMs) to embed both entity mentions and user-provided open-ended type descriptions into a shared semantic space. We show that internal representations, specifically the value vectors from mid-layer transformer blocks, encode fine-grained type information more effectively than commonly used top-layer embeddings. To refine these representations, we train a lightweight contrastive projection network that aligns type-compatible entities while separating unrelated types. The resulting entity embeddings are compact, type-aware, and well-suited for nearest-neighbor search. Evaluated on three benchmarks, NER Retriever significantly outperforms both lexical and dense sentence-level retrieval baselines. Our findings provide empirical support for representation selection within LLMs and demonstrate a practical solution for scalable, schema-free entity retrieval. The NER Retriever Codebase is publicly available at https://github.com/ShacharOr100/ner_retriever

artificial intelligence, large language model, natural language, (16 more...)

arXiv.org Artificial Intelligence

2509.04011

Country:

Asia > Singapore (0.05)
Oceania > Australia > Victoria > Melbourne (0.04)
North America > United States > Wyoming (0.04)
(9 more...)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Traveling Words: A Geometric Interpretation of Transformers

Molina, Raul

arXiv.org Artificial IntelligenceSep-18-2023

Transformers have significantly advanced the field of natural language processing, but comprehending their internal mechanisms remains a challenge. In this paper, we introduce a novel geometric perspective that elucidates the inner mechanisms of transformer operations. Our primary contribution is illustrating how layer normalization confines the latent features to a hyper-sphere, subsequently enabling attention to mold the semantic representation of words on this surface. This geometric viewpoint seamlessly connects established properties such as iterative refinement and contextual embeddings. We validate our insights by probing a pre-trained 124M parameter GPT-2 model. Our findings reveal clear query-key attention patterns in early layers and build upon prior observations regarding the subject-specific nature of attention heads at deeper layers. Harnessing these geometric insights, we present an intuitive understanding of transformers, depicting them as processes that model the trajectory of word particles along the hyper-sphere.

large language model, machine learning, natural language, (15 more...)

arXiv.org Artificial Intelligence

2309.07315

Country:

Europe > United Kingdom > Scotland (0.04)
South America > Bolivia (0.04)
North America > United States > Utah > Salt Lake County > Murray (0.04)
(11 more...)

Genre: Research Report > New Finding (0.87)

Industry:

Leisure & Entertainment (0.93)
Health & Medicine (0.67)
Food & Agriculture > Agriculture (0.67)
Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback